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1 The Early Years 

1.1 Russell and Poincare 

The word ’predicative’ first appeared in Russell’s note On Some Difficulties in 
the Theory of Transfinite Numbers and Order Types [Rus07]. Paradoxes such 
as Russell’s Paradox show that we cannot form the class {x \ 4>{x)} for all 
propositional functions 4>(x). Russell proposed we call <j>{x) predicative if it 
defines a class and non-predicative otherwise, but did not offer a criterion by 
which we could decide which propositional functions are which. 

A first such criterion was offered by Poincare in the third part of his paper Les 
Mathematiques et la Logique [Poi06]. He proposed the vicious circle principle: 
“The definitions which ought to be regarded as non-predicative are those which 
contain a vicious circle.” (p. 1063) He indicated by example what he meant by 
‘vicious circle’: in both the Richard paradox and the Burali-Forti paradox, we 
define an aggregate E, and make use of E within its own definition. 

Poincare proposed that definitions involving such a ‘vicious circle’ are il¬ 
legitimate: “A definition containing a vicious circle defines nothing.” [Poi06, 
p. 1065] 

Poincare justified this as follows. For a definition to be legitimate, it must be 
possible to substitute the definiens for the defined term. Recursive definitions 
are acceptable, as this substitution will always terminate. But in the case of a 
definition involving a vicious circle, the substitution will never terminate. 

Russell agreed with Poincare, and offered a more formal version of the 
vicious-circle principle in Les Paradoxes de la Logique [Rus06]: 

Whatever involves an apparent variable must not be among the pos¬ 
sible values of that variable. 

(An ‘apparent variable’ is a bound variable in modern terminology.) In a few 
years, he would offer a system of logic that adhered to this vicious-circle prin¬ 
ciple, the theory of types, first appearing in the paper Mathematical Logic as 
Based on the Theory of Types [Rus08], and later forming the basis of White- 
head and Russell’s Principia Mathematica [WR27]. An historical account of the 
theory of types is presented in [KLN02]. 


1 



1.2 Sets and Orders 

The types in Russell’s theory include the following. There is a type of individ¬ 
uals ; and, given any types A and Bi, ..., B ni we may form the type of sets 
{x | < f>(x )}, where x ranges over A and the quantifiers in 4>{x) range over the 
types B i, ..., B n only. This type is said to be higher than the types B±, ..., 
B n . Thus, in defining an object of a given type, one may only quantify over 
lower types. In particular, one may not quantify over a type C when defining 
an object of type C. 

We give names to some of these types. The Oth-order sets of individuals, 
or arithmetic sets 1 , are those that can be defined as {x \ 4>{x)}, where all the 
quantifiers in <p(x) range over individuals only. The lst-order sets are those that 
can be defined using quantifiers over numbers and quantifiers over Oth-order 
sets. The 2nd-order sets are those that can be defined using quantification over 
numbers, Oth-order sets and lst-order sets; and so forth. 2 

The ramification of sets into orders has some undesirable consequences. If 
we construct the real numbers as sets of rationals, we find ourselves with many 
different sets of reals: the Oth-order reals, the lst-order reals, and so forth. The 
completeness property does not hold for any of these sets of reals; the least 
upper bound of a bounded set of nth-order reals is, in general, a real of order 
n + 1. It is impractical to carry out real analysis in such a setting. Similar 
problems can be expected to arise in many other branches of mathematics. 

Russell’s solution, introduced in [R.us08] and used in Pricipia Mathematica 
[WR27] (with which, however, Whitehead and Russell were not satisfied) was 
to introduce the Axiom of Reducibility. With respect to the types introduced 
above, the axiom reads: for any nth-order set S, there exists a Oth-order set 
T with the same members as S. This axiom effectively allows one to perform 
impredicative definitions in the theory of types. 

Weyl’s solution to this same problem in Das Kontinuum [Weyl8] was to 
remove all orders above the Oth; to see how much of mathematics we can re¬ 
construct while restricting ourselves to the arithmetic sets. Many theorems of 
mathematics cannot be proved in this system, but often a weaker theorem can 
be proved which is almost as useful. For example, the completeness property of 
the reals does not hold in this system, but two weaker principles do hold: that 
every bounded set of rationals has a real least upper bound, and that every 
bounded sequence of reals has a least upper bound. Weyl showed that these 
principles were sufficient for many of the purposes for which the completeness 
property is used. 

1 Russell called these the predicative classes. We avoid this terminology, as it is a very 
different meaning of the word to the one we are considering in this article. Weyl called these 
the finite sets [Weyl8]. 

2 Strictly, the theory of types deals with types of propositional functions (predicates), not 
of sets. Statements involving the set {x \ rf>(x)} are seen as shorthand for statements involving 
the propositional function <f>(x). The theory of types also included types of propositions, and 
of binary propositional functions, ternary propositional functions, and so forth. 
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2 Predicativity Modulo the Natural Numbers 

2.1 Wang-Kreisel Predicativity 

Instead of restricting ourselves to order 0, we can pursue the opposite approach 
and introduce transfinite orders. Godel [G44] suggested constructing the sets of 
order a, where a ranges the ordinals. For each successor ordinal a + 1, the sets 
of order a +1 are those that can be defined by quantifying over the sets of order 
a. For each limit ordinal A, the collection of sets of order A is the union of the 
collections of sets of order a for a < A. This idea was later named the ramified 
analytic hierarchy by Feferman [Fef05]. Following Feferman’s notation, let us 
denote by R a the set of all sets of order a. 

Godel showed that we can prove impredicatively that there exists an ordinal 
a such that every set in the ramified hierarchy is coextensional with a set of 
order a. Thus, we could pursue classical mathematics, making use of sets of 
order a alone. However, the ordinal a cannot be defined predicatively. 

Lorenzen [Lor55] (according to Feferman [Fef05]) and Wang [Wan54] sug¬ 
gested restricting the ramified hierarchy to the recursive ordinals. To formalize 
this idea, Wang introduced a sequence of formal systems £ a , where a ranges 
over the recursive ordinals. The intended interpretation of £ a comprises the 
natural numbers and the sets of order < a. 

If we extend the hierarchy over all the recursive ordinals, Kleene [Kle59] 
showed that the sets obtained are exactly the hyperarithmetic sets; that is, those 
sets definable by both a n(-formula and a £}-formula. Spector [Spe55] showed 
that the hyperarithmetic well-orderings are exactly the recursive well-orderings. 
In 1960, Kreisel [Kre60a] argued that these and other results suggest that the 
predicatively definable sets should be identified with the hyperarithmetic sets. 

We may argue for this conclusion as follows 3 . Consider a predicativist, who 
may only construct a set if it is defined using a definiton that involves only 
previously constructed sets. If he has constructed a well-ordering of order type 
a , he may use transifinite recursion over this well-ordering to define the set 
R a , and so can construct all the sets of order < a. The predicativist begins 
with the arithmetic sets. For any recursive ordinal a, there is an arithmetic 
well-ordering of order type a. Thus, the predicativist may construct all the 
sets R a for a recursive, and hence all the hyperarithmetic sets. These sets do 
not include any well-orderings with non-recursive ordinal, and so the procedure 
comes to a halt. The subsets of N the predicativist has been able to construct 
are exactly the hyperarithmetic sets. 

The following is the important premise of this argument. For reference, we 
shall refer to it as Wang-Kreisel’s principle: 

If a well-ordering W of order type a can be predicatively defined, 
then the sets of order < a are predicatively definable. 

’’This argument is not Kreisel’s. It is similar to the argument given in Feferman [Fef05]. 
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2.2 Feferman-Schiitte Predicativity 

A few years before his 1960 paper, Kreisel [Kre60b] suggested modifying Wang’s 
heirarchy as follows. It is not enough that the well-ordering IF can be defined 
predicatively; the predicativist must also be able to prove that W is a well¬ 
ordering. Only then is the system S Q predicatively acceptable. However, he 
wrote “I have no information about the least ordinal not obtained by these 
extensions starting from Z." 

While Kreisel had abandoned this proposal by 1960 Check this, it was 
taken up independently in 1964-5 by Feferman [Fef64] and Schiitte [Sch65], 
who showed that the ordinals obtained by this process are exactly those below 
r 0 , the least ordinal such that Xr o (0) = r 0 , where Xa(fi) is the Veblen critical 
function. 

We record the new principle as Feferman-Schiitte’s principle: 

If a well-ordering IF of order type a can be predicatively defined and 
predicatively proven to be a well-ordering, then the sets of order < a 
are predicatively definable. 

Given a hierarchy of systems S a , Feferman defined the autonomous ordinals 
with respect to this hierarchy to be the least initial segment of the ordinals that 
contains u> and is closed under addition and the clause: 

If a is autonomous and S a proves that there exists a well-ordering 
of length 0, then j3 is autonomous. 

In [Fef64], Feferman gave two hierarchies of systems intended to characterise 
predicative reasoning and showed that, for either hierarchy, the autonomous 
ordinals are exactly those below r 0 . Schiitte independently gave another, with 
the same conclusion. Check the differences between Wang’s, Kreisel’s, 
Feferman’s and Schutte’s systems. This was held to be strong justification 
for identifying the predicatively definable sets with the sets of order < To- One 
consequence of this is that the systems with proof-theoretic ordinal < To are 
exactly those systems that can be predicatively proven to be consistent. 

2.3 Weaver’s Objection 

Since the 1960s, the logical community has generally accepted Feferman and 
Schutte’s criterion as the definition of predicativity. Recently, however, Weaver 
[Wea05] has argued that much larger ordinals that To should be considered 
predicatively acceptable. He has raised two objections to Feferman’s argument 
above. 

Firstly, the argument above uses the principle that, if we have proved IF 
to be a well-order, then we may perform transfinite induction over IF. This 
inference is justified classically, but not necessarily predicatively. In particular, 
proving IF is a well-order allows one to conclude 

Va :(\/y((y, x)£W->y£X)-^x£X)^> \/xx € X 
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for any set X , but not 


Vx(Vy((y,x) £ W —> </>(y)) -> <j>(x)) -> Vx^(;r) 

for any formula 4>{x), which is what is required to construct R a . 

Weaver thus proposes replacing Feferman-Schiitte’s principle with the fol¬ 
lowing, which we shall call Weaver’s principle: 

If a well-ordering W of order type a can be predicatively defined 
and we can predicatively prove all formulas of the following schema: 

\/x(\/y((y,x) £ W -> (j){y)) -> <j>(x)) -> Vx^(;r) 

then the sets of order < a are predicatively definable. 

By itself, this modification would lead to a smaller class of ordinals being 
predicatively acceptable. However, Weaver also suggests a second principle 
Let us say that a formal system is predicatively acceptable if all its theorems 
are predicatively provable. Then Feferman-Schiitte’s principle can be rewritten 
as 


If £ q is predicatively acceptable and S Q b I((3), then S^ is predica¬ 
tively acceptable. 


or as 


(A) If I(/3) is predicatively provable and S^ b (j> , then (f> is predica¬ 
tively provable. 

Weaver suggests that the same principles that justify this reasoning also justify 
the schema 

(B) For any formula 4>, the formula I{x) — > P{x, r (jP) —> <f> is pred¬ 
icatively provable. 

He shows that adding this schema to So allows us to prove well-orderings of 
order type To and higher exist. 

Weaver also presents his own hierarchy of systems. He considers the oper¬ 
ation of extending a formal system with a truth predicate ; for any system S, 
the theory Tarski(S) extends S with a predicate Tx, intended to express “x is 
the Godel number of a formula that is true in every model of S.” The extra 
axioms of Tarski(S) express the conditions: the axioms of S are true; the rules 
of deduction of S preserve truth; if <j>(n) is true for every natural number n then 
\/x<fi(x) is true. 

He defines a way of transifinetly iterating the operation ‘Tarski’, and pro¬ 
poses: if a predicativist accepts the system S, and S proves that transifinite 
recursion over a is possible, then the predicativist should accept the result 
of applying Tarski to S a times. Check the details here. He presents a 
system that is predicatively acceptable by this principle, which can prove the 
existence of ordinals much larger than To. The upper limit on the ordinals that 
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can be proven to exist in this system is at least (j>n^( 0); the precise upper limit 
is unknown. 

Against this, Friedman (‘Weaver’s error?’ on the FOM mailing list, 10 Apr 
2006) has argued that, given (A), we need not accept (B). There is a difference 
between claiming ‘If a predicativist accepts /(/3), and ^ is a theorem of 
then there is a way for the predicativist to come to accept </>’ and claiming ‘A 
predicativist will accept the implication: if I{(3) and S^ b </> then (f>\ Feferman 
claims the first, but not the second. 

Weaver replied (‘Friedman’s confusion?’ on the FOM mailing list, 11 Apr 
2006) that he accepts that (A) does not entail (B); nevertheless, it seems ‘im¬ 
plausible under any conception of predicativism’ that every instance of (A) 
should be true and (B) not true. If we can see by some means that (A) is true, a 
predicativist should be able to do the same; and if a predicativist can accept (A), 
then (B) is true. Feferman (http://www.math.wustl.edu/~nweaver/responsel.html) 
replied (to a more general form of this argument), saying that it is possible that 
a predicativist might accept each instance of (A) by a different reasoning proce¬ 
dure, not by a uniform method that would lead to him accepting the implication 
in (B). Weaver (http://www.math.wustl.edu/~nweaver/response2.html) replied 
that the means by which we come to recognise (A) should also be available to 
the predicativist. 

The argument raised with each side claiming the burden of proof lay with the 
other: Weaver’s critics insisting that (A) does not entail (B), Weaver demanding 
a form of justification for (A) that does not also justify (B). 

3 Inductive Definitions 

Alongside the above historical development of the concept of predicativity, there 
has been a debate on whether inductive definitions should be considered as 
predicative definitions. Consider the inductive definition of the set of natural 
numbers: 

1. 0 is a natural number. 

2. The successor of any natural number is a natural number. 

3. The natural numbers are exactly those objects that can be constructed by 
the first two clauses. 

In his theory of types, Russell did not allow himself to introduce the natural 
numbers by an inductive definition, nor to take them as given; instead, he de¬ 
fined them. Having defined the concepts of cardinal number, zero and successor, 
he defined ‘n is a natural number’ to mean 

for all sets X, if 0 £ X and \/x(x € X —> x + 1 £ X), then n £ X. 

As it stands, this is an impredicative definition. If we recast it so X only ranges 
over sets of order a for some a, then we will have Oth-order natural numbers, 
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lst-order natural numbers, and so forth; a similar problem to the one we faced 
with real numbers in Section 1.2 above. Whitehead and Russell’s solution was 
to use the Axiom of Reducibility. 

Wang [Wan59] agreed that, when an inductive definition is converted into 
an explicit definition by this method, the result is an impredicative definition. 
However, he argued that an inductive definition can be read as an ‘implicit 
definition’, in which case it is unobjectionable. He introduced a set of axioms 
for the predicative sets, which included: 

(7) An inductive definition containing explicitly only predicative notions de¬ 
fines a predicative set. 

(8) Given any predicative ordinal, the result of iterating an inductive defini¬ 
tion scheme as many times as the ordinal yields a predicative set. 

Wang states that, as a consequence of adopting these principles, the predicative 
sets of natural numbers are exactly the hyperarithmetic sets. 

Lorenzen and Myhill [LM59] also argued that inductive definitions are ac¬ 
ceptable to the predicativist, because they do not make use of the actual infinite. 

Dummett [Dum63] disagreed, arguing that 

[T]he notion of ‘natural number’, even as characterised by the for¬ 
mal system, is impredicative. The totality of natural numbers is 
characterised as one for which induction is valid with respect to any 
well-defined property, where by a ‘well-defined property’ is under¬ 
stood one which is well defined relative to the totality of natural 
numbers. 

Thus, however we interpret the inductive definition of N, the interpretation 
must involve the validity of proving by induction a statement that involves 
quantification over N, and this is an impredicativity. 

Kreisel [Kre65, Kre60a] made the intriguing suggestion that inductive defi¬ 
nitions may be impredicative when using classical logic, but may not be when 
using intuitionistic logic. He justified this by presenting an abstract theory of 
constructions, some of which are to be singled out as proofs. From the induc¬ 
tive definition of the natural numbers, we know what it means to prove that 
a given construction is a natural number. The validity of proof by induction 
rests in the fact that, whenever we are given a property P, proofs of P(0) and 
P(a) —y P(a + 1), and a natural number a, we can construct a proof of P(a). 
This last statement is justified by the theory of constructions, and not by reading 
the inductive definition as quantifying over all properties P. 

More recently, Parsons [Par92] has argued that the word ‘predicative’ has 
been used with two different meanings historically. In the modern sense, sets are 
‘impredicatively defined if they are given by abstracts involving quantification 
over some totality of sets to which they themselves belong.’ In this sense, 
inductive definitions are impredicative; Parsons’ argument is very similar to 
Feferman’s. However, Parsons argues Poincare and Lorenzen and Myhill used 
the word in a weaker sense: for them, sets and classes should be viewed as the 
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extensions of predicates, and a definition is impredicative if it defines a set using 
a predicate that refers to the set itself. In this sense, inductive definitions are 
predicative, as Lorenzen and Myhill argued. 


4 Predicativity in Type Theory 

In general, the type theory community has confined its interest in predicativity 
to the question of whether a universe is predicative. 

Roughly, a universe U is a type of types. We say U is impredicative if some 
of the types in U are constructed using U itself; for example, if IIx : U.x has 
type U. Otherwise, U is predicative. A type theory is called predicative if it 
contains no impredicative universes. 

Martin-Lof’s original type theory [ML71] used a single universe V, which was 
highly impredicative as the system used the typing rule V € V. After Girard 
[Gir72] showed that the system was inconsistent, Martin-Lof [ML75] modified 
the system so that the universe V is predicative. He called the objects in V the 
small types, and the other types the large types. This system has been referred 
to (e.g. by Aczel and Gambino [AG02]) as ML\. 

Later [ML82] Martin-Lof extended the system again to include W-types, 
types of well-ordered trees, and an infinite series of predicative universes Vq, V\, 
V‘ 2 , ..., with Vo € V\ € V 2 € ■ ■ ■ We shall refer to the systems with n universes, 
with and without W-types, as ML n and ML n W respectively, and the systems 
with infinitely many universes as ML and MLW. 

Each of the types in ML is a simple inductive definition, and thus the 
system is predicative modulo the natural numbers in Feferman and Schiitte’s 
sense. It was proven independently by Feferman [Fef82] and Aczel 4 that the 
proof-theoretic strength of ML is To- Therefore, one can build a model of 
ML without using only constructions that are predicative modulo the natural 
numbers, and the ordinals that can be proven to exist in the system are exactly 
those less than r 0 . 

The definition of W-types is not a simple inductive definition, but a gener¬ 
alised inductive definition. Including W-types increases the strength of a type 
theory dramatically. Setzer [Set93] has shown that even MLiW has a proof- 
theoretic ordinal that is much larger than To- If we agree with Parsons that there 
are two senses of the word ’predicative’, then type theories with W-types are 
predicative in Lorenzen and Myhill’s sense, but not in Feferman and Schiitte’s 
sense. 

It is worth noting that consistent impredicative type theories have also been 
employed. For example, the Calculus of Constructions [CH88] uses a single 
impredicative universe, while ECC [Luo94] uses an impredicative universe below 
an infinite series of predicative universes. 


4 Palmgren [Pal98] claims without citation that Aczel made this discovery. 



A Proof-Theoretic Ordinals of Type Theories 

Ordinal Definitions The Veblen functions (j> a (0) for all ordinals a, (3 are 
defined as follows: 

• 4>o{0) = ^ 

• For a > 0, the function (f> a enumerates the ordinals that are fixed points 
of <j)p for all (3 < a. 

Define the sequence of ordinals 7 ,, for n a natural number by 

7o = e o 
7ra+1 07n(O) 


Let T 0 = sup n 6 N 7 „. 

Let I be the least weakly inaccessible cardinal (i.e. the least regular cardinal 
such that H/ = /), and let I n be the nth weakly inaccessible cardinal. For every 
regular cardinal k > oj, define the ordinal ip K (ot) and the set of ordinals C{a,{3) 
simultaneously by transfinite recursion on a as follows: 

• ip K (a) is the least (3 such that k £ C(a, (3) and C(a, (3) n k C (3 (i.e. there 
is no A € C(a, (3) such that (3 < A < k). 

• C(a,(3 ) is the closure of (3 U {0,1} under the operations +, <j>, A 1 —> 

and ( 7 r, £) 1 —> where 7r is a regular cardinal > u> and £ < a. 

We now list the proof-theoretic ordinal of several type theories, in ascending 
order. 


Type Theory 

Ordinal 

Proof 

ML 1 

7i 

[Acz77] 

ML n 

7 n 

[Fef82] 

ML 

Lo 

[Fef82], Aczel citation? 

M L-| W 

V^i (Nj+w) 

[Set93] 

ML n W 

V’Ki (N/„+w) 

[Set98] conjectured, not proved? 

MLW 

(XlJ 

[Set98] conjectured, not proved? 


It would be interesting to find out how (f> n^(0) and (N/„) compare. 
Check Setzer’s latest papers 
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